AIbase
Product LibraryTool NavigationMCP

Search AI Products and News

  • AI News
  • AI Tools
2025-07-03 09:38:06.AIbase

Scientists Have Something to Say! SciArena Platform Launches Multi-Dimensional Evaluation of Large Language Models' Scientific Performance

2025-06-20 08:57:11.AIbase

Lower video cost! MiniMax Hailuo 02 outperforms Google Veo 3 in user benchmark tests

2025-06-17 15:35:29.AIbase

​Moon's Dark Side Releases New Open Source Model Kimi-Dev-72B, Breaking Programming Benchmark Records

2025-06-17 11:12:21.AIbase

In-Depth Review of Body Type Calculator: Is the AI Helper for Scientifically Shaping the Perfect Physique Actually Reliable?

2025-06-12 11:39:58.AIbase

Xpeng G7 Global Launch: Becomes a New Benchmark for L3-Level AI Cars with Self-Developed Turing Chip!

2025-06-09 08:59:39.AIbase

New King of Long Text Understanding? Gemini2.5Pro Beats o3 and Leads Fiction.Live Benchmark

2025-06-04 10:13:15.AIbase

Stanford's Latest Evaluation: DeepSeek R1 Medical AI Model Outperforms Google and OpenAI with High Scores

2025-06-04 09:25:31.AIbase

Fish Audio Releases OpenAudio S1: A New Benchmark for AI Voice with Professional Dubbing Actor Quality

2025-05-29 11:07:22.AIbase

Google's Big Move! Open Source Evaluation Framework LMEval Launched, Making AI Model Comparisons More Transparent

2025-05-28 11:36:00.AIbase

Evaluation of Multi-modal Large Model Visual Reasoning Capability: o3 Scores Only 25.8%

2025-05-27 15:43:32.AIbase

Peking University Team First Systematically Evaluates the Psychological Characteristics of Large Language Models, Promoting New Standards for AI Evaluation

2025-05-27 11:21:46.AIbase

OpenAI Releases Healthcare AI Evaluation Benchmark Dataset HealthBench

2025-05-26 13:47:15.AIbase

Sequoia China Launches New AI Benchmark Tool to Help Establish New Standards for Intelligent Assessment

2025-05-12 08:58:55.AIbase

First Intelligent Document Processing Benchmark Released: Gemini Leads but Shortcomings Remain, Multimodal AI Faces Real Challenges

2025-05-10 10:15:47.AIbase

UGMathBench Dynamic Benchmark Dataset Released to Evaluate Language Models' Mathematical Reasoning Ability

2025-04-27 09:04:54.AIbase

Moonshot AI Unveils Kimi-Audio: A New Benchmark for Open-Source Audio Foundation Models

2025-04-27 08:53:10.AIbase

Step1X-Edit: A New Benchmark in Open-Source Image Editing, Rivaling Closed-Source Models like GPT-4o

2025-04-24 09:05:24.AIbase

AWS Releases SWE-PolyBench: A New Open-Source Benchmark for Evaluating AI Programming Assistants

2025-04-18 10:53:12.AIbase

LMArena Officially Launches, Dedicated to Providing a Neutral AI Evaluation Platform

2025-04-16 11:24:23.AIbase

OpenAI Acquires Context.ai Team to Enhance AI Model Evaluation